B 



PCT 



WORLD INTELLECTUAL PROPERTY ORGANIZATION 



(51) International Patent Classification ^ : 




(11) International Publication Number: 


WO 00/34317 


C07K 14/435, 14/315, 16/46, GOIN 
33/563 


A2 


(43) International Publication Date: 


15 June 2000(1 5.06.(X)) 



(21) International Application Number: PCT/GB99/041 19 

(22) International FHing Date: 8 I>ecembfer 1999 (08.1Z99) 



(30) Priority Data: 

9826925.1 
9902139.6 

(71) Applicant (for alt designated States except US): BIOVATION 
LIMITED [GB/GB); Investment House, 6 Union Row, 
Aberdeen AB10 IDQ (GB). 

(72) Inventors; and 

(75) Inventors/Applicants (for US only): CARR, Francis, Joseph 
[GB/GB]; Biovation Limited, Crombie Lodge, Aberdeen 
Science Park, Balgownie Drive, Abenleen AB22 8GU (GB). 
ADAIR, Fiona, Suzanne [GB/GB); Biovation Limited, 
Crombie Lodge. Aberdeen Science Park, Balgownie Drive, 
Aberdeen AB22 8GU (GB). HAMILTON, Anita, Anne 
[GB/GB]; Biovation Limited, Crombie Lodge, Aberdeen 
Science Park, Balgownie Drive. Aberdeen AB22 8GU (GB). 
CARTER, Graham [GB/GB]; Biovation Limited, Crombie 
Lodge, Aberdeen Science Park, Balgownie Drive, Aberdeen 
AB22 8GU (GB). 



(74) Agents: SHEARD, Andrew, Gregory et al.; Kilbum & Strode, 
20 Red Lion Street, London WCIR 4PJ (GB). 



(81) Designated States: AL, AM, AT, AU, AZ, BA, BB, BG, BR, 
BY, CA, CH, CN, CR, CU, CZ, DE, DK, DM, EE, ES, Fl, 
GB, GE, GH, GM, HR, HU, ID, IL. IN, IS, JP. KE, KG, 
KP, KR, KZ, LC, LK. LR, LS, LT. LU. LV. MA, MD, MG, 
MK, MN, MW, MX, NO, NZ, PL, PT. RO. RU, SD, SE, 
SG, SI, SK, SL, TJ, TM. TR, rr, TZ, UA, UG, US. UZ, 
VN, YU, ZA, ZW, ARIPO patent (GH, GM, KE, LS, MW, 
SD, SL, SZ, TZ. UG, ZW), Eurasian patent (AM, AZ, BY, 
KG, KZ, MD, RU, TJ, TM), European patent (AT, BE, CH, 
CY, DE, DK, ES, H, FR, GB. GR, IE. IT, LU. MC, NL. 
PT, SE). OAPI patent (BF, BJ, CP, CG. CI, CM, GA, GN, 
GW, ML. MR. NE, SN, TD. TG). 



PubUshed 

Without intemational search report and tc 
upon receipt of that report. 



(54) ntle: MODIFYINO PROTEIN IMMUNOGENICITY 
(57) Abstract 

It is knovra that proteins, or parts of them, may be rendered non- or less immunogenic to humans or other species by identifying 
or more potential T cell epitopes and eliminaUng them by amino acid modification. ConvenUonally, certain epitopes may be retained 
in a protein sequence if die peptides constituting such epitopes are present in endogenous human protein, since they would be recogniswl 
as "self. However, it has now been found that even self epitopes may give rise to immune reacticms. The invention provides foe their 
elimination, fw example by recombinant DNA technology, to render them more useful for administration to humans, for example for 
therapeutic or diagnostic purposes. 



FOR THE PURPOSES OP INFORMATION ONLy 
Codes used to identify States party to the PCT on the front pages of pamiAlets publishing international ai^lications under the PCT. 




PCT/GB99/04n9 



MODIFYING PROTEIN IMMUNOGENICITY 

THE PRESENT INVENTION relates to proteins to be administered especially to 
humans particularly for therapeutic use but also for use in diagnostic tests. The 
5 invention particularly provides for proteins which are modified to be less 
immunogenic than the unmodified counterpart when used in vivo. 

The invention particularly addresses the clinical need for a process by which the 
natural tendency of the host to mount an immune response against an administered 
10 protein is substantially reduced or eliminated. There are several examples where the 
administration of protein molecules offers therapeutic benefit. However the benefit is 
greatly reduced particularly where multiple doses of the therapeutic protein are 
required as the recipient immune system recognises and accelerates the elimination of 
the incoming therapeutic protein. 

15 

There are a number of ther^eutic proteins whose therapeutic use is curtailed on 
accoimt of their immunogenicity in man. For example, when murine antibodies are 
administered to patients who are not immunosuppressed, a majority of patients mount 
an immune reaction to the foreign material by making human anti-murine antibodies 
20 (HAMA). There are two serious consequences. First, the patient's anti-murine 
antibody may bind and clear the theri^eutic antibody or immxmoconjugate before it 
has a chance to bind to the tumour and perform its fimction. Second, the patient may 
develop an allergic sensitivity to the murine antibody and be at risk of anaphylactic 
shock upon any future expostffe to murine immimoglobulin. 

25 

SevCTal techniques have been enq)loyed to address the HAMA problem and thus 
enable the use in humans of therapeutic monoclonal antibodies (see, for example WO- 
A.8909622, EP-A-0239400, EP-A-0438310, WO-A-9 109967). A common aspect of 
these methodologies has been the intitxiuction into the therapeutic antibody, which in 



wo 00/34317 



-2- 



PCT/GB99/04I19 



general is of rodent origin, of significant tracts of sequence identical to that present in 
human antibody proteins. Such aherations are also usually coupled to alteration of 
particular single amino acid residues at positions considered critical to maintaining the 
antibody-antigen binding interaction. For antibodies, this process is possible owing to 
5 the very high degree of structural (and functional) conservatism between antibody 
molecules of different species. However for potentially therapeutic proteins where no 
structural honjologue may exist in the host species (e.g. human) for the therapeutic 
protein, such processes are not applicable. Furthermore, these methods have assumed 
that the general introduction of human sequence will render die re-modelled antibody 

10 non-immunogenic. It is known however, that certain short peptide sequences ('T cell 
epitopes") can be released during the degradation of proteins within cells and 
subsequently presented by molecules of the major histocompatability complex (MHC) 
in order to trigger the activation of T cells. For peptides presented by MHC class II, 
such activation of T cells can then give rise to an antibody response by direct 

15 stimulation of B cells to produce such antibodies. None of the previous methods have 
addressed the elimination or avoidance of such epitopes in the final therapeutic 
molecule as a means to reduce or eliminate the antibody response against proteins. 
Nor have previous methods considered the elimination of peptides presented by MHC 
class I which can trigger a cytotoxic T cell response leading to cell killing. Such 

20 killing of cells causes the release of cellular components including proteins which can, 
in turn, activate specialist cells which are highly active in protein processing and MHC 
presraitation and also in release of inflammatory cytokines as a result of such 
activation. As a result, an inflanmiatory environment can be created which promotes 
the more active uptake and processing of the protein for therapeutic use thus 

25 facilitating induction of an antibody response against the piotein. 

The elimination of T cell epitopes fi-om proteins has been previously disclosed (see 
WO-A-9852976) whereby such potential T cell epitopes are defined as any peptide 

30 potential epitopes are measured by any computational or physical method to establish 



wo 00/34317 



-3- 



PCT/GB99/04119 



MHC binding. Implicit in the term "T cell epitope" is an epitope which is recognised 
by the T cell receptor and which can at least in principle cause the activation of these 
T cells. It is however usually understood that certain peptides which are found to bind 
to MHC class II molecules may be retained in a protein sequence because such 
5 peptides are recognised as "self within the organism into which the final protein is 
administered. 

In practice, soluble proteins introduced into autologous organisms do sometimes 
trigger an immune response resulting in development of host antibodies which bind to 
10 the soluble protein. One example is interferon alpha 2 to which a percentage of human 
patients make antibodies despite the fact that this protein is produced endogenously. 

The present invention is based on the discovery by the inventors that MHC-binding 
peptides within autologous proteins can trigger immune responses in live organisms to 

1 5 those proteins even when the specific protein is endogenously produced. A possible 
explanation for this is that the doses of such proteins administered are much higher 
than normal providing MHC-peptide formation which can activate T cells. 
Alternatively, the physiological environment into which such administered proteins 
are presented is one favourable to efficient antigen processing and peptide-MHC 

20 formation at higher levels than usually encountered, for example in inflammatory 
situations. 

According to a first aspect of the invention, there is provided a method of rendering a 
protein, or part of a protein, non-immunogenic, or less immunogenic, to a given 
25 species, the method comprising: 

(a) determining at least part of the amino acid sequence of the protein; 



identifying in the ammo acid sequence one or more potential epitopes for T 



wo 00/34317 



-4- 



PCT/GBW/04119 



cells ("T cell epitopes") which are found in an endogenous protein of the given 
species; and 

(c) modifying the amino acid sequence to eliminate at least one of the T cell 
5 epitopes identified in step (b) thereby to reduce the immunogenicity of the protein or 
part thereof when exposed to the immune system of the given species. 

The given species will usually be human. In step (c), one or more actual epitopes or 
potential epitopes may be eliminated. 

10 

Therefore, the present invention provides a modified method for creating proteins 
which trigger a reduced or absent immune response whereby one or more MHC 
binding peptides which are also found in the autologous organism's endogenous 
proteins are modified to reduce or eliminate binding to MHC molecules. Thus, in 

15 practice, no assumption is made about which peptides might be tolerated by the 
organism (unless data is available which indicates that tolerance is in place). Overall, 
the invention encompasses the creation of protein molecules with reduced or absent 
immunogenicity in live organisms where one or more peptides which bind to MHC 
molecules are eliminated from the protein molecule. As such, the invention is an 

20 improvement on the prior art which focuses only on the removal of potential T cell 
epitopes from proteins with the assumption that the organism is tolerant to self 
proteins ot peptide sequences. In particular, the invention includes a range of human 
and non-human proteins which are modified by modification of one or more MHC 
binding peptides withia the sequences. Such MHC binding peptides can include 

25 pqjtides which bind to MHC class U molecules and/or pqjtides which bind to MHC 
class I. 



The aspect of the invention described above is preferred to the alternative strategy of 
separately immunising the organism with such peptides in order to induce tolerance or 



wo 00/34317 



-5- 



PCT/GB99/04n9 



the alternative of administering analogues or fragments of the protein which resist 
binding or uptake into cells (especially antigen-presenting cells), for example by 
elimination of the cell binding site in the protein. Tolerising the organism to the 
peptide in this way ( e.g. the use of MHC binding peptides identified from a syngeneic 
5 protein which can be used to induce tolerance in the host prior to administration of the 
entire protein) would require two therapeutic molecules, one to perform the main 
protein fiinction and the other to induce the tolerance, which would not be favoured 
from the regulatory point of view. 

10 Any method of identifying - in which term is included predicting - one or more 
potential T cell epitopes can be used in the invention. Acceptable methods may be 
computational or physical and include measuring of the binding of MHC-peptide 
complexes to T cell receptors, the testing of potential T cell epitopes in transgenic 
mice expressing human MHC molecules, the testing of potential T cell epitopes in 

15 mice reconstituted with human antigen-presenting cells and T cells in place of their 
endogenous cells, and the testing of potential T cell epitopes for stimulation of T cells 
in vitro via presentation of peptides on MHC molecules using syngeneic antigen- 
presenting cells. 

20 As well as identifying in the amino acid sequence one or more potential T cell 
epitopes which are found in an endogenous protein of the given species, a method of 
the invention may, and typicalfy will, involve additionally identifying and eliminating 
one or more potential T cell epitopes which are not found in an endogenous protein of 
the given species, since such epitopes are even more likely to be immunogenic. 

25 

The present invention is based on a new concept in development of therapeutic 
proteins. It has been recognised previously that, with exogenous T cell dependent 
proteins, helper T cell epitopes are important to development of a significant immune 
response to the autologous protein. Such T cell epitopes fimction through the internal 



wo 00AJ4317 



-6- 



PCT/GB99/041I9 



processing of the protein releasing peptides which complex with MHC class II 
molecules and which are then presented on the surface of appropriate cells for 
potential binding to receptors on T cells. However, it is very difBcult to predict which 
peptides within the protein will be processed appropriately such that MHC binding can 
5 occur and it is also a factor that many processed peptides will not bind to all allotypic 
variants of MHC class II molecules (MHC restriction). Furthermore, in any given 
living organism, it is difficult to predict whether a T cell response will actually be 
triggered by a given peptide presented on MHC class II because T cells may have been 
tolerised to such epitopes or may not have the repertoire of T cell receptors to bind to 

10 the MHC class Il-peptide complex. Due to these complicating factors, it has 
previously been impractical to develop proteins with reduced or absent 
immunogenicity because of the difficulty in predicting actual T cell epitopes. The 
invention herein primarily takes the new approach of creating improved therapeutic 
proteins by removal of potential rather than actual T cell epitopes (usually defined by 

1 5 binding to MHC molecules) such that certain peptides within the molecule which are 
not actually immunogenic may be altered in addition to those immunogenic peptides. 
For a therapeutic molecule, preferably all of the potential T cell epitopes are removed 
whilst retaining activity of the protein. Preferably, this involves judicious choice of 
the amino acid substitutions enabling removal of the T cell epitopes and usually will 

20 involve the testing of a range of variant molecules with different amino acid 
substitutions. 

Not all identified potential T cell epitopes which are found in an endogaious protein 
need be eliminated. For example, immune responses in general are not mounted to 

25 autologous circulating proteins, such as immunoglobulins and other serum proteins 
such as serum albumin. So potential T cell epitopes which are found for example in 
gamline immunoglobulin variable region protein sequences of the given species 
(which will generally be human) may sometimes be ignored. However, an epitope 
considered as not generally available to the immxme system to gain tolerance may be 

30 identified as a potential epitope for elimination according to the method of the present 



wo 00/34317 



-7- 



PCT/GB99/041I9 



invention. Examples of such include epitopes include those of intracellular proteins, 
such as nuclear proteins and integral membrane proteins. 

The invention therefore provides proteins which have been altered to reduce their 
5 immunogenicity as well as a general method of altering such proteins to reduce their 
immunogenicity. A major principle of the present invention is that proteins are altered 
by identification of potential T cell epitopes and their subsequent alteration within the 
proteins in order to eliminate such potential epitopes. Optionally, epitopes recognised 
by host antibodies ("B cell epitopes") can also be removed if these can be identified 
10 for example with the aid of patient antisera or if the surface residues in a non- 
autologous protein can be altered to those of a related autologous protein endogenous 
to the host organism without losing all of the desired activity of the non-autologous 
molecule. 

15 After elimination of one or more T cell epitopes, the protein may be tested for desired 
activity. It may retain its full activity, or it may retain a sufficient proportion of its 
original activity to be useful. In some circumstances, the activity may be altered either 
beneficially or at least in an acceptable way. Proteins entirely lacking in useful 
function post modification may be discarded. 

20 

Proteins of the present invention include those which have potential clinical iise in 
humans or veterinary use in animals, especially vfbcre such uses involve multiple 
administrations of the proteins. They may be non-autologous or autologous. Proteins 
of the present invention include proteins with an enzymatic activity which has a 
25 beneficial dierapeutic effect, proteins which are used to convert inactive drugs to 
active drugs within living organisms, proteins which are used to vaccinate whereby 
certain immunogenic epitopes are undesirable, proteins which perform as carriers of 
other molecules within the living organism and proteins which bind to other molecules 

30 the other molecules. Amongst the examples of non-autologous proteins with potential 



wo 00/34317 



-8- 



PCT/GB99/04119 



benefit for in vivo human use are the following: Thrombolytic proteins streptokinase 
and staphylokinase; pro-drug activating enzymes such as asparaginase and 
caiboxypeptidase G2; toxins of plant, fiingal and bacterial origin such as ricin, 
saponin, pokeweed antiviral protein, mistletoe lectin, cholera toxin, pertussis toxin, 
5 diphtheria toxin; bacterial phospholipase C (e.g. PE33, PE35, PE37, PE38, PE40); 
ribotoxins such as alpha-sarcin, clavin, mitogillin, restrictocin and bryodin-l and 2. 
Other significant non-autologous molecules of proven and potential therapeutic benefit 
include streptavidin, cobra venom factor, insulin, collagen, non-autologous antibodies, 
non-autologous MHC molecules and non-autologous T cell receptor molecules. 
10 Autologous proteins of proven and potential therapeutic value include IL-2, interferon 
a and p. granulocyte macrophage colony stimulating factor (GMCSF), granulocyte 
colony stimulating factor (GCSF), tissue plasminogen activator (t-PA), insulin. Factor 
vm. Factor IX, erythropoietin (EPO), pituitary growth hormone and megakaryocyte 
growth and development factor (MGDF) and derivatives of these proteins produced by 
15 recombinant methods. Other examples of proteins include recombinant molecules 
derived or comprised of single or multiple domains from autologous or non- 
autologous proteins, recombinant proteins engineered to change or modify function or 
protein sequences derived from any of the above molecules. 

20 A typical protocol within the general method of the present invention comprises the 
following stq>s: 

I. Determining the amino acid sequence of the protein or a part thereof (if 
modification only of a part is required); 

25 

n. Identifying potential T cell epitopes within the amino acid sequence of the 
protein by any method including determination of the binding of peptides to 
MHC molecules, determination of the binding of peptide:MHC complexes to 
the T cell receptors firom the species to receive the theranentir. pmtRin, testing 
30 of the protein or peptide parts thereof using transgenic animals with the MHC 



wo 00/34317 



-9- 



PCT/GB99/04119 



molecules of the species to receive the therapeutic protein, or testing with 
transgenic animals reconstituted with immune system cells from the species to 
receive the therapeutic protein; 

By genetic engineering or other methods for producing modified proteins, 
altering the protein to remove one or more of the potential T cell epitopes and 
producing such an altered protein for testing; 

(optionally) Within step III., ahering the protein to remove one or more of the 
potential B cell epitopes; 



V. Testing altered proteins with one or more potential T cell epitopes (and 
optionally B cell epitopes) removed in order to identify a modified protein 
which has retained all or part of its desired activity but which has lost one or 
1 5 more T cell epitopes. 



Potential T-cell epitopes herein are defined as specific peptide sequences which either 
bind with reasonable efficiency to MHC class II molecules (or their equivalent in a 
non-htmian species), or which in the form of peptide:MHC complexes bind strongly to 

20 the T cell receptors from the species to receive the therapeutic protein or which, from 
previous or other studies, show the ability to stimulate T-cells via presentation on 
MHC class n. The method of the present invention recognises that an effective T cell- 
dq>endant immune response to a foreign protein requires activation of the cellular aim 
of the immune system. Such a response requires the uptake of the therapeutic 

25 (foreign) protein by antigen presenting cells (APCs). Once inside such cells, the 
protein is processed and fragments of the protein form a complex with MHC class n 
molecules and are presented at the cell sinface. Should such a complex be recognised 
by binding of the T cell receptor from T-cells, such cells can be, under certain 
conditions, activated to produce stimulatory cytokines. The cytokines will elicit 

30 differentiation of B-cells to full antibody producing cells. In addition, such T cell 



wo 00/34317 



-10- 



PCT/GB99/04119 



responses may also mediate other deleterious effects on the patient such as 
inflammation and possible allergic reaction. 

It is understood that not all peptide sequences will be delivered into the correct MHC 
5 class II cellular compartment for MHC class II binding or will be suitably released 
form a larger cellular protein for subsequent MHC class II binding. It is further 
understood that even such peptides which are presented by MHC class II on the 
surface of APCs may not elicit a T-cell response for reasons including a lack of the 
appropriate T-cell specificity, tolerance by the immune Systran to the particular peptide 
1 0 sequence or the low afSnity of the MHC-peptide complex for T cell receptor. 

The present invention provides for removal of human (or other given species) potential 
T-cell epitopes from the therapeutic protein whereby the primary sequence of the 
therapeutic protein can be analysed for the presence of MHC class n binding motifs by 

15 any suitable means. For example, a comparison may be made with databases of 
MHC-binding motifs such as, by searching the "motifs" database at the world-wide 
web site httD://wehih. webi.edu.auynihcpep/ (formerly wehil.wehi.edn.au or 
wehi.webi.edu.au\ Alternatively, MHC class n binding peptides may be identified 
using computational threading methods such as those devised by Altuvia et al. {{J. 

20 Mol. Biol. 249 244-250 (1995))). Similar methods may be used to identify and 
remove epitopes which bind to MHC class I. 

Having identified potential given species {e.g. human) T-cell epitopes, these epitopes 
are then eliminated by alteration of one or more amino acids, as required to eliminate 

25 the T-cell ^itope. Usually this will involve alteration of one or more amino acids 
within the T-cell epitope itself. This could involve altering an amino acid adjacent to 
the epitope in terms of the primary structure of the protein or one which is not adjacent 
in the primary structure but is adjacent in the secondary structure of the molecule. The 
usual aJlciiiuyu couLcmplalcu will be amino acid subsrimtion, but m certain 

30 circumstances amino acid deletion or addition will be appropriate. In some instances. 



wo 00/3431 7 



-11- 



PCT/GB99/04119 



selection of the appropriate amino acid substitution at any particular point in the 
therapeutic protein primary structure, can be made with reference to an homologous 
protein primary structure. This is particularly the case where a repertoire of 
homologous genes exist, as for example with immunoglobulins, or a single 
5 homologous gene for the therapeutic protein exists, as is the case for the cobra venom 
factor protein and hiunan complement factor C3 detailed in Example 1 . However in 
other cases, there may not be an homologous protein to the therapeutic protein as is the 
case for potential therapeutics such as the thrombolytic agent staphylokinase. In such 
cases amino acids may be selected on the basis of similar size and/or charge, or more 

10 preferably and where structiual data for the protein exist in combination with or with 
reference to in silico protein modelling techniques. In some instances amino acid 
substitutions may be made to prevent proteolytic cleavage of a protein and thereby 
inhibit binding of a peptide to MHC class II. Examples of protease cleavage sites 
include those reported by van Noort & van der Drift {J. Biol. Chem. 264 14159-14164 

15 (1989)) and van Noort et al {Eur. J. Immunol. 21 1989-1996 (1991)). Examples of 
amino acids which prevent protease digestion have been identified e.g. Kropshofer et 
al., J. Immunol. 151 4732-4742 (1993). 

In the method of the present invention, usually a number of variants of aUered proteins 
20 will be produced and tested for retention of the desired activity. Where it is desirable 
to maximise the removal of potential T cell epitopes firom the protein, it will be 
common to create a range of variants including some with poteitial T cell epitopes 
remaining in the molecule. It will be recognised that with certain protein molecules, it 
is difficult to radically alter the molecule and to retain iiill activity and so judicious use 
25 of molecular modelling of variants and the testing of a 

maximiTm 

number of variants is 

desirable. For molecular modelling, standard commercially available software 
packages can be used to model the protein structure using as starting point either of a 
crystal structure or a model built fix>m homology or protein folding prediction. 
Information on parts of the protein involved in imparting the molecule with its activity 
30 will assist in modelUng variants enabling best choice of altered amino acids which 



wo 00/34317 



-12- 



PCT/GB99/04119 



remove a potential T cell epitope (or optional B cell epitope) whilst retaining activity. 

In the method of the present invention where a number of variants of altered proteins 
will be produced and tested for retention of the desired activity, it is desirable to have 
5 a high-throughput method for screening large numbers of variants. Such methods 
include in vivo methods for expression of gene variants in cells such as bacteria with 
rapid isolation of the proteins for example using an antibody to a conserved region of 
the proteins, or via a protein purification tag included in the protein sequence. As an 
alternative, in vitro methods such as in vitro transcription and translation can be used 
1 0 in some cases. Where the protein binds to another molecule, display methods such as 
phage display and ribosome display can be used in order to select out variants which 
retain binding activity. 

In the practice of the present invention, specific amino acid alterations can be 
15 conducted by recombinant DNA technology, so that the fmal molecule may be 
prepared by expression from a recombinant host using established methods. However, 
the use of protein chemistry or any other means of molecular alteration is not excluded 
in the practice of the invention. Whilst the present invention provides a method for 
removal of potential T cell epitopes (and optional B cell epitopes) from protein 
20 molecules, the method does not exclude testing of variant molecules produced within 
the invention for actual T cell epitope activity including testing of the protein or 
peptide parts thereof using transgenic animals with the MHC molecules of the spedes 
to receive the therapeutic protein, or testing with transgenic animals reconstituted with 
immune system cells from the species to receive the thenqveutic protein. 

25 

Whilst such methods are not yet routine, in vitro T cell assays can be undertaken 
whereby the protein can be processed and presented on MHC class n by appropriate 
antigen-presenting cells (APCs) to syngeneic T cells. T cell responses can be 
Hicasurcd by sunplc prolifcratiOu ujcastiiciijcuis (especially if ihc APCs have been 
30 irradiated or otherwise treated to prevent proliferation) or by measuring specific 



wo 00/34317 



-13- 



PC7r/GB99/04n9 



cytokine release. In order to account for different MHC class II allotypes, a range of 
in vivo assays will usually be required in order to broadly test for T cell epitopes. 
Alternatively, transgenic animals equipped with human (or the desired species) MHC 
class n molecules could be used to test for T cell epitopes especially where the host 
5 MHC class II repertoire has been removed and especially where one or more other 
host accessory molecules in APC/T cell mteraction have also been replaced with 
human (or the desired species) such as CD4 on T cells. 

In a further aspect, the invention provides molecules resulting from method as 
1 0 described above. Such molecules may be useful for treating or preventing a disease or 
condition. The invention also extends to the use of such molecules in in vivo and in 
vitro diagnosis and for human or veterinary use. Preferred features of each aspect of 
the invention are as for each other aspect, mutatis mutandis. 

15 The invention is illustrated but not limited by the following examples. The examples 
refer to the following figures: 



FIGURE 



I shows the protein sequence of mature cobra venom factor, 



20 



FIGURE 



2 shows the protein sequence of an altered cobra venom factor - 



chain; 



FIGURE 



3 shows the protein sequence of an altered cobra venom factor - 



chain; 



25 



FIGURE 



4 shows the protein sequence of an altered cobra venom factor - 



chain; 



riGunE 



5 shows ibe protein sequence of streptokinase from Streptococcus 



30 



equisimilis; 



wo 00/34317 



-14- 



PCT/GB99/04H9 



FIGURE 6 shows the protein sequence of an altered streptokinase molecule; 
FIGURE 7 shows the protein sequence of mature staphylokinase; 

5 

FIGURE 8 shows the protein sequence of an altered staphylokinase molecule; 

FIGURE 9 shows the protein sequence of wild-type bryodin 1; 

1 0 FIGURE 1 0 shows the protein sequence of an altered bryodin 1 molecule; 

FIGURE 1 1 shows the protein sequence of mature human interferon 2 ; and 

FIGURE 12 shows the protein sequence of an altered human interferon 2 
15 molecule. 

Example 1 

In the present example the cobra venom factor (CVF) protein is analysed for the 
presence of potential MHC binding motifs and a method disclosed for the removal of a 
20 number of these from the molecule. 

CVF is the non-toxic complement-activating glycoprotein in cobra venom. The 
mature CVF protein consists of three polypeptide chains a, p and y, produced by post- 
translational processing of a pre-pro protein of ^jproximate molecular weight ISOkDa 
25 (Fritzinger D.C. et al., Proc Nat'l. Acad. Sci. USA 91 12775-12779 (1994)). The 
a and p chains are both linked to the y chain via single disulphide bridges. In addition, 
both the a chain and the y chain have single intra chain disulphide bonds whilst the p 
chain has a complex arrangement of six further internal disulphide bonds. The mature 
protein has tour potential sites for N-glycosylation of which only three are used 



wo 00/34317 



-15- 



PCT/GB99/04I19 



(Vogel, C.W & Muller-Eberhard J. Immunol. 18 125-133 (1984)). 

In the preferred embodiment of the present invention, production of the reduced 
immunogenicity or altered version of the therapeutic protein is by recombinant DNA 
technology. For the present example, it is understood that the expedient of 
recombinant production of the mature CVF protein alone is likely to provide a 
considerable reduction in immunogenicity of the protein. This assertion stems from 
the knowledge that a degree of the immunogenicity arising from administration of 
CVF purified from l^aja naja venom, arises from antibody responses to the particular 
pattern of glycosylation not encountered in mammalian proteins (Gowda, D.C et al, J. 
Immunol. 152 2977-2986 (1994)). It is also understood that deglycosylated and also 
sialylated CVF that would arise from recombinant production of CVF, for example in 
a mammalian producer cell system, have equal fbnctional activity with native mature 
CVF (Gowda, D.C et al. (1994) ibid.). 

In humans CVF triggers the alternative (properdin) pathway of complement activation. 
CVF forms a bimolecular cony)lex with factor B which is cleaved by factor D into Bb 
which remams bound to CVF to form the C3/C5 convertase CVF3b. This complex is 
fimctionally and structurally analogous to the human C3/C5 convertase formed during 
the activation of the altemative pathway. The clinical importance of CVF lies in the 
high relative stability of the CVF3b complex versus the native C3/C5convertase {t^n 
of 7 hours versus txn of 90 seconds) and in particular, the ind)ility of CVF3b to be 
disrupted and hence down regulated by factors H and I (Gowda, D.C et al, (1994) 
ibid,). The net result of CVF administration therefore is to lead to the dqpl^on of 
coii^)lanrait in human serum by continuous efficient and uncontrolled conq)lement 
activation. 

This property has been exploited as a research tool for in vivo and in vitro de- 
complementation (Cochrane C.G f^t nl ^ J Immunol. 195 55 (1970)) snd has also been 
exploited for the selective elimmation of tumour cells as a conjugate with a tumour 



wo 00/34317 



-16- 



PCT/GB99/04119 



targeting monoclonal antibody (Vogel, C.-W. & Muller-Eberhard H.J. Proc. Nat'l. 
Acad. Sci. USA 78 7707 (1981)). Availability of a non-immunogenic CVF would have 
considerable importance as a potential therapeutic agent for the depletion of 
complement in the plasma of patients undergoing organ xenotransplantation or in the 
construction of second generation antibody conjugates for targeted activation of 
complement in cancer patients. 

Method for identification of potential MHC class II binding motifs in cobra 
Venom Factor: 

The sequence of CW was identified fi-om the GenBank database. The sequence with 
accession number U09969 was used throughout. The protein sequences of the mature 
a, p and y chains were identified (Figure 1) and analysed individually for the presence 
of potential MHC class II binding motifs. Analysis was conducted by computer aided 
comparison to a database of MHC-binding motifs as resident on world wide web site 
http:/Avehih.wehi.edu.au/mhcpep/ . 

Results of the "searching" process on the a chain, indicate the presence of 638 
potential MHC class II binding motifs. Of these, 525 matched sequences identified in 
a database of human germline immunoglobulin variable region protein sequences. 
These epitopes were not considered finlher on the basis that immune responses in 
general are not mounted to autologous circulating proteins such as inuntmoglobulins. 
This implies immimological tolerance to the potential T-cell qiitopes present in the 
structure of the immunoglobulins (and indeed the majority of human proteins), 
l^itopes presented by non-autologous proteins such as CVF which are identical or 
similar to motife present in immunoglobulin proteins are likely also to be tolerated and 
in practice may be retained through the de-immunisation process. 

Following subtraction of the human immunoglobulin protein germline motife, the 
reraammg 113 potential epitopes in the a chain, were analysed individually for 



wo 00/34317 



-17- 



PCT/GB99/041I9 



similarity to non-immunoglobiilin protein sequences. In practice, predicted anchor 
residues for each potential epitope were used in a consensus sequence search of human 
expressed proteins. The SwissProt and GenBank translated sequence databases were 
interrogated using commercially available software (DNAstar Madison, WI, USA). 
5 Epitopes identified in known circulating human proteins were not considered fiirther 
and were therefore allowed to remain unchanged within the CVF a chain. An 
example of one such rejected potential epitope is given by the sequence FKPGMPY at 
positions 332-338 in the CVF a chain (numbering from first amino acid of the mature 
a chain). This sequence represents a predicted consensus binding motif for HLA- 

10 DRQB1*0301 with anchor residues underlined. Database searching using the 
consensus sequence FxxxMxY identifies only a single entry in a human protein sub- 
set of the SwissProt database corresponding to human serum biotinidase (SwissProt 
accession # A54362). An example of an epitope where no match to a human protein 
considered to be in the general circulation was found is provided by sequence 

15 YYQVGNNELat position 501-509 in the CVF a chain. This sequence represents a 
potential epitope for presentation by HLA-DRB1*0101. Consensus sequence 
searching identifies only four human proteins containing this motif, of which three are 
nuclear proteins of differentiated tissues such as brain, and the fourth is an integral 
membrane proteirL These may be considered as not generally available to the immune 

20 system to gain tolerance and therefore identify this as a potential epitope for 
elimination according to the method of the prwent invention. Similarly, a fiirther 
potential HLA-DRl binding motif was identified in the a chain. Pq)tide sequence 
YVWQVTGE at positions 81-89. This motif identifies a single human nuclear 
protein DNA polymerase alpha (GenBank accession # M64481), in the same data set 

25 and was also therefore identified for modification by the method of the present 
invention. Following these processes, a total of 15 potential epitopes in the a chain 
were considered for removal by amino acid substitution. 

Results of the "searching" process on the p chain, indicate the presence of 366 



wo 00/34317 



-18- 



PCT/GB99/04119 



potential MHC class n binding motifs. Of these, 281 matched sequences identified in 
a database of human germline immunoglobulin variable region protein sequences and 
were not considered further. Following a second round of subtractions using a 
database of human serum protein sequences (as above), 18 potential epitopes were 
5 considered for removal. 

Results of the "searching" process on the y chain, indicate the presence of 267 
potential MHC class 11 binding motifs. Of these, 219 matched sequences identified in 
a database of himian germline immunoglobulin variable region protein sequences and 
10 were not considered further. Following a second round of subtractions using a data 
base of human serum protein sequences (as above), 9 potential epitopes were 
considered for removal. 

The net result of these processes was to identify those residues within the CVF 
15 molecule which should be altered to eliminate potential MHC class II binding motifs. 
Individual amino acids within the predicted bindmg motifs were selected for 
alteration. With the object of maximising the likelihood of maintaining protein 
fimctional activity, in all cases conservative amino acid substitutions were chosen at 
any given site. In many cases, individual amino acid or even short strings (<5 
20 consecutive residues) were substituted by reference to the pubhshed sequence of 
human complement factor 3 (C3) (GenBank accession # K02765) which shows areas 
of strong homology, including regions of identity, with CVF. In two instances, 
potential q)itopes were eliminated from the p chain by insertion of additional amino 
acids. Amino acids were selected wifli reference to the human 03 protein. 

25 

Altered CVF a chain, P chain and y chain sequences were compiled (Figures 2 to 4) 
and further analysed by database comparison, as previously, for confirmation of 
successful elimination of potential MHC class II binding motifs. 



wo 00/34317 



-19- 



PCT/GB99/04119 



Method for construction of altered CVF molecules: 

PCR primers CVFlfor, 5'ATAAGAATGCGGCCGCATGGAGAGGATGGCTCTCT 
and CVFlrev; 5'ATAAGAATGCGGCCGCTATCATTGATTCTTCTGAAC were 
used to amplify a 4985bp fragment from a Xgtl l cobra venom gland cDNA library 
(Fritzinger DC et al, ibid.). The PCR was conducted on a total library DNA 
preparation using high fideUty polymerase mix optimised for long distance PCR 
{Advantage PCR Kit, Clontcch, Basingstoke, UK) and conditions suggested by the 
supplier. The PCR product was cloned into pcDNA3.1 (Invitrogen, Leek, The 
Netherlands) as a Notl restriction fragment using standard techniques (Sambrook J., 
Fritsch E.F. & Maniatis T. (eds.) in: Molecular Cloning a Laboratory Manual, Cold 
Spring Harbor Laboratory Press, NY, USA (1989)). The gene sequence was 
confirmed to be identical to database entries using commercially available reagent 
systems and instmctions provided by the supplier (Amersham. Little Chalfont, UK). 
Site directed mutagenesis was conducted using synthetic ohgonucleotides and the 
"quick-change" procedure and reagents from Stratagene (Cambridge, UK). Mutated 
(de-immmiised) versions of the gene were confirmed by sequencing. Mutated CVF 
genes were transfected into CHO cells by electroporation. Stable transfectants were 
selected using G418, and clones secreting active CVF were selected using a C 
consmnption assay (Gowda D.C et al. (1994) ibid.; Cochrane C.G. et al (1970) ibid.) 
Expressing clones were expanded and recombinant protein was purified from the 
culture supernatant using sequential colmnn chromatography and methods as 
descnlJed (Vogel C-W et al., J. Immunological Methods 73 203-220 (1984)). 



Example 2 

The present invention details a process whereby potentially immunogenic epitopes 
within a non-autologous protein may be identified and offers methodology whereby 
such epitopes may be eliminated. It is understood that there are a number of proven 
therapeutic proteins for which their therapeutic use is curtailed on account of their 
immuriogei-iicity ii. .umi. Li iLc picseiu example me iherapeutic protein streptokinase 



wo 00/34317 



-20- 



PCT/GB99/04119 



is analysed for the presence of potential MHC binding motifs and a method disclosed 
for the removal of a number of these from the molecule. 

Streptokinase (SK) is a single chain protein of approximate molecular weight 47kDa 
5 that is produced by certain strains of P-haemolytic streptococci (Huang T.T. et al, 
Mol. Biol. 2 197-205 (1989)). The protein has no iidierent enzymatic activity but has 
considerable clinical importance owing to its ability to efficiently bind human 
plasminogen, potentiating its activation to plasmin and thereby promoting the 
dissolution of fibrin filaments in blood clots. Several studies have shown that SK is an 

10 effective thrombolytic agent in the treatment of coronary thrombosis, improving 
survival (ISIS-2 Collaborative Group, Lancet 2 349-360 (1988)) and preserving left 
ventricular fimction following myocardial infarction (ISAM Study Group, N. Engl. J. 
Med. 314 1465-1471 (1986); Kennedy J.W. et al., Circulation 77 345-352 (1988)). 
Despite the undoubted therapeutic value of SK, the non-autologous origin of the 

15 protein is disadvantageous due to its immunogenicity in humans. The production of 
neutralising antibodies in the patient in generally limits the protein to a single use. 

Method for identificatioii of potential MHC class II binding motifs in 
streptokinase: 

20 The sequence of streptokinase was identified from the GenBank database. The 
sequence vwth accession number S46536 was used throughout (Figure 5). The 
sequence was analysed for the presence of potential MHC class II binding motifs by 
computer aided conq>arison to a database of MHC-binding motife as resident on world 
wide wd) site http:/Avehih.wehi.edu.au/mhcpep/ . 

25 

Results of the "searching" process indicate the presence of 395 potential MHC class II 
binding motifs. Of these, 283 matched sequences identified in a database of human 
germline immunoglobulin variable region protein sequences. These q)itopes were not 
considered finlher on the basis that immune responses in general are not mounted to 



wo 00/34317 



-21- 



PCT/GB99/04119 



autologous circulating proteins such as inununoglobulins. This implies 
immunological tolerance to the potential T-cell epitopes present in the structure of the 
immunoglobulins (and indeed the majority of human proteins). Epitopes presented by 
non-autologous proteins such as SK which are identical or similar to motifs present in 
5 immunoglobulin proteins are likely also to be tolerated and in practice may be retained 
through the de-immunisation process. 

Following subtraction of the human immunoglobulin protein germhne motifs, the 
remaining 112 potential epitopes were analysed individually for similarity to non- 
10 immunoglobulin protein sequences. In practice, predicted anchor residues for each 
potential epitope were used in a consensus sequence search of human expressed 
proteins. The SwissProt and GenBank translated sequence databases were 
interrogated using commercially available software (DNAstar Madison, WI, USA). 
Epitopes identified in known circulating human proteins were not considered further 
15 and were therefore allowed to remain unchanged within the SK molecule. An 
example of one such rejected potential epitope is given by the sequence LLKAIQEQL 
at positions 79-87 in the SK protein. This sequence represents a predicted consensus 
binding motif for HLA-DR1*0101 with anchor residues underlined. Database 
searching using the consensus sequence LxxAxxxxL identifies >4000 entries in a 
20 hionan protein sub-set of the SwissProt database, including serum albumin protein 
(SwissProt accession number P02768). An example of an epitope where no match to a 
human protein considored to be in the gaieral circulation was found is provided by 
sequence YVDVNTN at position 299-305 in the SK protein. This sequence represents 
a potential epitope for presentation by HLA.-DR4*0401. Consensus sequence 
25 searching identifies < 50 himian proteins containing this motif^ of which many are 
intracellular proteins of differentiated tissues such as brain. These may be considCTed 
as not generally available to the immune system to gain tolerance and therefore 
identify this as a potential q>itope for elimination according to the method of the 
present invention. Simiiariy, a fimher potential HLA-DR1''0i01 binding motif was 
30 identified in the SK peptide sequence KADLLKAI at positions 76-83 of the SK 



wo 00/34317 



-22- 



PCT/GB99/04119 



protein. This motif identifies < 150 human proteins in the same data set and was also 
identified for modification by the method of the present invention. 

The net result of these processes was to identify those residues within the SK molecule 
5 which should be altered to eliminate potential MHC class n binding motifs. 
Individual amino acids within the predicted binding motifs were selected for 
alteration. With the object of maximising the likelihood of maintaining protein 
fimctional activity, in all cases conservative amino acid substitutions were chosen at 
any given site. A new (de-immunised) SK sequence was compiled (Figure 6) and 
10 further analysed by database comparison, as previously, for confirmation of successful 
elimination of potential MHC class n binding motifs. 

Method for construction of de-immunised SK molecules: 

PGR primers SKI (5'-ggaattcatgattgctggacctgagtggctg) and SK2 (5'- 

15 tggatccttatttgtcgttagggtatc) were used to amplify the wild-type SK gene fi-om a strain 
of Streptococcus equisimililis group C (ATCC accession number 9542). The resulting 
1233bp fragment was cloned into pUC19 as a BamYH-EcoSl restriction fragment 
using standard techniques (Sambrook J., Fritisch E.F. & Maniatis T. (eds.) in: 
Molecular Qoning a Laboratory Manual, Cold Spring Harbor Laboratory Press, NY, 

20 USA (1989)). The gene sequence was confirmed to be identical to database entries 
using commercially available reagent systems and instructions provided by the 
su]^lier (Amo&ham, Little Chalfont, UK). Site-directed mutagenesis was conducted 
using synthetic oligonucleotides and the "quick-change" procediffe and reagents from 
Stratagene UK Ltd. Mutated (de-immunised) versions of the gene were confirmed by 

25 sequencing. Mutated SK genes were sub-cloned as EcdSil-BamSi. firagmoits into the 
bacterial expression vector pEKG-3 (Estrada M.P. et al., Bio/Technology 10 1138- 
1 142 (1992)) for expression of de-immunised SK. Recombinant protein was purified 
using a plasminogen affinity column according to the method of Rodriguez et al. 
(Rodriguez. P. ei ai., Bioiechniques 7 63S-64i (1592)). Fibrinolytic activity was 

30 assessed using the casein/plasminogen plate technique and the in vitro clot lysis assay 



wo 00/34317 



-23- 



PCT/GB99/04119 



as described by Estrada et al. (Estrada et ai, ibid.). 
Example 3 

■ In the present example staphylokinase is analysed for the presence of potential MHC 
5 binding motifs and a method disclosed for the removal of a number of these from the 
molecule. 

Staphylokinase protein from Staphylococcus aureus has recognised and well 
characterised profibrinolytic properties. Recombinant forms of the protein have been 

10 previously produced and in vitro and in vivo studies have indicated that the protein 
holds considerable promise for thrombolytic therapy (Sako, T., Eur. J. Biochem. 149 
557-563 (1985); Schlott, B. et al. Biotechnology 12 185-189 (1994); Collen, D. et al. 
Circulation 94 197-462 (1996)). However, clinical use in himians has been limited 
due to the demonstrated immunogenicity of the protein in man (Collen, D. et al, 

15 Circulation 95 463-472 (1997)). Availabihty of a non -immunogenic staphylokinase 
would have considerable importance as a potential agent for thrombolytic therapy. 

The mature staphylokinase protein consists of a single polypeptide chain of 137 amino 
acids with approximate molecular weight 15.4kDa (Silence K. et al, J Biol 
20 CAe/wisfry, 270 27192-27198 (1995)). 

Method tot identificatioB of potential MHC class 11 binding motifis in 
staphylokinase: 

The sequence of st^hylokinase (sakSTAR) protein as given in table 1 of Coll«i et al. 
25 (CoIIot, ^ et al. (1996) ibid.) was used tbroughout. The protein sequences of the 
mature stjq)hylokinase (Figure 7) were analysed for the presence of potential MHC 
class n binding motifs. Analysis was conducted by computer aided comparison to a 
database of MHC-binding motifs as resident on worid wide web site 
uUp.//wcliih.wehi.edu.au/mhcpep/'. 



30 



wo 00/34317 



-24- 



PCT/GB99/041I9 



Results of the "searching" process indicate the presence of 128 potential MHC class II 
binding motifs. Of these, 91 matched sequences identified in a database of human 
gennlinc immunoglobulin variable region protein sequences. These epitopes were not 
considered further. Epitopes presented by hon-autologous proteins such as 
5 staphylokinase which are identical or similar to motifs present in immunoglobulin 
proteins are Ukely to be tolerated and in practice may be retained through the de- 
immunisation process. 

Following subtraction of the human immunoglobulin protein germline motifs, the 
10 remaining 37 potential epitopes were analysed individually for similarity to non- 
immunoglobulin protein sequences. In practice, predicted anchor residues for each 
potential epitope were used in a consensus sequence search of human expressed 
proteins. The SwissProt and GenBank translated sequence databases were 
interrogated using commercially available software (DNAstar Madison, WI, USA). 
15 Epitopes identified in known circulating human proteins were not considered further 
and were therefore allowed to remain unchanged within the protein. 

The net result of these processes was to identify those residues within the 
staphylokinase molecule which should be altered to eliminate potential MHC class II 
20 binding motifs. Individual amino acids within the predicted binding motifs were 
selected for alteration. With the object of maximising the likelihood of maintaining 
protein fimctional activity, in all cases conservative amino acid substitutions were 
chosen at any given site. 

25 An altered st^hylokinase sequence was compiled (Figure 8) and fiirther analysed by 
database comparison, as previously, for confirmation of successfiil elimination of 
potential MHC class II binding motifs. 



Mciliim fur cuasiruciion of altered stapiiyiokinase molecules: 

A wild-type staphylokinase gene was synthesised under conh^ct by Genosys 



wo 00/3431 7 



-25- 



PCT/GB99/04119 



Biotechnologies Ltd (Cambridge, UK). The gene was constructed by PCR using long 
(80-mer) overlapping synthetic primers and the sequence as given by CoUen, D et al. 
(Collen, D. et al., (1996) ibid.) The synthetic gene was cloned as a 453 bp fcoRI- 
HindUl restriction fragment into bacterial expression vectoir pMEX (MoBiTec, 
5 Gottingen, Germany). The gene sequence was confirmed to be identical to database 
entries using commercially available reagent systans and instructions provided by the 
supplier (Amersham, Little Chalfont, UK). Altered (reduced immunogenicity) 
versions of the gene were engineered using site directed mutagenesis of the wild-type 
gene in pMEX. Short (18-mer) synthetic oligonucleotides and the "quick-change" 
10 procedure and reagents from Stratagene (Cambridge, LJK) were used to create the 
variant genes. All variant gene sequences were confirmed by DNA sequencing. 

Mutated staphylokinase genes were transformed into E. coli strain TGI by standard 
techniques. A single transformed clone was selected and clones secreting active 
15 staphylokinase were selected using a fibrin plate assay (Astrup, T. et al.. Arch. 
Biochem. Biophys. 40: 346-351 (1952); Collen, D. et al. Fibrinolysis ^. 203-213 
(1992)). The best expressing clone was grown up and recombinant protein was 
purified fix)m the ciilture supernatant using sequential colunm chromatogr^hy and 
methods as described previously (Collen, D. et al, (1992) ibid.; Schlott et al, ibid.). 

20 

Example 4 

In Ihc present example bryodin 1 is analysed for the presence of potential MHO 
binding motife and a method disclosed for the removal of a number of these from the 
molecule. 

25 

The gene for bryodin 1 protein has recently been cloned firom Bryonia dionia, a 
member of the Cucurbitaceae family of plants (Gawlak, S. et al. Biochemistry 36 
3095-3103 (1997)). Bryodin 1 is a type 1 ribosome inactivating protein. Studies using 
recombinani fuims ui ilic piuicin have indicated that bryodin i holds considerable 
30 promise for immunotoxin therapy for cancer and other diseases (Gawlak, S. et al. 



wo 00/34317 



-26- 



PCT/CB99/04119 



(1997) ibid.). However, clinical use in humans as with other immuno toxin agents, is 
likely to be curtailed due to immunogenicity of the protein in man. Availability of a 
non-immunogenic bryodin would have considerable importance as a potential 
component in immunofoxin based tHerapies. 

5 

The mature bryodin protein consists of a single polypeptide chain of 267 amino acids 
with approximate molecular weight 29kDa. The wild-type sequence is illustrated in 
Figure 9. 

10 Method for identification of potential MHC class II binduig motifs in bryodm 1 : 

The sequence of bryodin 1 protein as given by Gawlak et al, (Gawlak, S. et al, (1997) 
ibid.) was used throughout. The protein sequence of the mature bryodin 1 (Figure 9) 
was analysed for the presence of potential MHC class 11 binding motifs. Analysis was 
conducted by computer aided comparison to a database of MHC-binding motifs as 
1 5 resident on world wide web site http://wehih.wehi.edu.au/mhcpep/ . 

Results of the "searching" process indicate the presence of 315 potential MHC class II 
binding motifs. Of these, 259 matched sequences identified in a database of human 
gennline immunoglobulin variable region protein sequences. These epitopes were not 
20 considered further. 

Following subtraction of the human inununoglobulin protein germline motife, the 
remaining 56 potential epitopes were analysed individually for similarity to non- 
immunoglobnlin protein sequences. The predicted anchor residues for each potraitial 
25 epitope wo-e used in a consensus sequence search of human expressed proteins. The 
SwissProt and GenBank translated sequence databases were interrogated using 
commercially available software (DNAstar Madison, WI, USA). Epitopes identified 
in known circulating human protems were not considered further and were therefore 
mluwcu iu iciiiain uucuaiigcu wiihin ibe protein. 

30 



wo 00/34317 



-27- 



PCr;GB99A)4119 



The net result of these processes was to identify those residues within the bryodin 1 
molecule which should be altered to eliminate potential MHC class II binding motifs. 
For bryodin 1 protein, 13 potential epitopes were identified for elimination from the 
molecule. Individual amino acids within the predicted binding motifs were selected 
5 for alteration. With the object of maximising the likelihood of maintaining protein 
functional activity, in all cases conservative amino acid substitutions were chosen at 
any given site. 

An altered bryodin 1 sequence was compiled (Figure 10) and further analysed by 
10 database comparison, as previously, for confirmation of successful elimination of 
potential MHC class II binding motifs. 

Method for construction of altered bryodin 1 molecules: 

A wild-type bryodin 1 gene was synthesised under contract by Genosys 
1 5 Biotechnologies Ltd (Cambridge, UK). The gene was constructed by PCR using long 
(80-mer) overlapping synthetic primers and the sequence as given by Gawlak et al., 
(Gawlak, S. et al., (1997) ibid.). The synthetic gene was cloned as a 843bp Ncol- 
EcoRl restriction fragment into a modified version of bacterial expression vector 
pET22b+ (Novagen, Madison, USA). The vector was modified to remove the peB 
20 leader sequence which was previously shown to impede efficient expression of the 
bryodin 1 protein (Gawlak, S. et al., (1997) ibid.). The modification was^conducted by 
digestioB with Xbal and Ncol to remove lOTbp of DNA encompassing the pelB leader 
sequence. Elements of non-peB sequence including the ribosome binding site 
sequence and the Ncol site, were restored in flie vector by ligation of a linko: molecule 
25 to the XbaVNcol free ends in the vector. The linker was formed by annealing 
complementary oligonucleotides: 

Llf (5'-ctagaaataattttgtttaactttaagaaggagatatacatatgcc) and 
Llr (5'-ccatggatatgtatatctccttcttaaagttaaacaaaattattt). 



wo 00/34317 



-28- 



PCT/GB99/04n9 



Oligonucleotides were supplied with phosphorylated ends by Genosys 
Biotechnologies Ltd (Cambridge, UK). Restriction digests, DNA purification, ligation 
reactions etc. were conducted using standard procedures and conditions recommended 
by the regent suppliers. Altered (reduced iimnuriogenicity) versions of the gene were 
5 engineered using site directed mutagenesis of the wild-type gene in the modified 
pET22b vector. Short (18-mer) synthetic oUgonucleotides and the "quick-change" 
procedure and reagents fi-om Stratagene (Cambridge, UK) were used to create the 
variant genes. All variant gene sequences were confirmed by DNA sequencing. 

Wild-type and mutated bryodin 1 gene variants were transformed into E. coli strain 
TGI by standard techniques. A single transformed clone was selected for each gene 
and this clone used for sequence analysis. For expression work, bryodin 1 genes were 
transformed into E. coli strain BL21(XDE3) obtained from the ATCC. Recombinant 
bryodin 1 and variant bryodin 1 proteins were purified using methods described 
previously (Gawlak, S. et al, (1997) ibid.). Following re-folding of the crude protein 
from inclusion bodies, purified bryodin 1 and variants were obtained > 95% pure using 
CM-Sepharose chromatogr^hy as previously (Gawlak, S. et al, (1997) ibid.). 
Activity of the recombinant proteins was assessed using a cell-fi-ee protein synthesis 
inhibition assay and methods according to Siegall et al., (Siegall, C.B. et al., 
Bioconjugate Chem. 5 423-429 (1994)). 

Example 5 

In the present example the human interferon a2 protein is analj^ for the presence of 
potential MHC binding motifs and a method disclosed for the removal of a number of 
25 these fiom the molecule. 

Interferon alpha 2 (INA2) is an important glycoprotein cytokine expressed by 
activated macrophages. The protein has antiviral activity and stimulates the 
production of at least two enzymes; a protein kinase and an oligoadenylate synthetase, 



15 



wo 00/34317 



-29- 



PCT/GB99/04119 



on binding to the interferon alpha receptor in expressing cells. The mature INA2 
protein is single polypeptide of 165 amino acids produced by post-translational 
processing of a 188 amino acid pre-cursor protein by cleavage of a 23 amino acid 
signal sequence bom the amino terminus. 

5 

The protein has considerable clinical importance as a broad spectrum anti-viral, anti- 
proliferative and immunomodulating agent. Recombinant and other preparations of 
rNA2 have been used therapeutically in a variety of cancer and viral indications in 
man (reviewed in Sen, G.G. and Lengyel P, J. Biol. Chem. 267 5017-5020 (1992)). 

10 However despite very significant therapeutic benefit to large numbers of patients, 
resistance to therapy in certain patients has been documented and one important 
mechanism of resistance has been shown to be the development of neutralising 
antibodies detectable in the serum of treated patients (Quesada, J.R. et al., J. Clin. 
Oncology 31522-1528 (1985); Stein R.G. et al. New Eng. J. Med. 318 1409-1413 

15 (1988); Russo, D. et al, Br. J. Haem. 94 300-305 (1996); Brooks M.G. et al. Gut 30 
1116-1122 (1989)). An immune response in these patients is mounted to the 
therapeutic interferon despite the fact that a molecule of at least identical primary 
structure is produced endogenously in man. In the present example, an interferon 
alpha 2 molecule with reduced potential immunogenicity in man is presented. 

20 

Method for identification of potential MHC class n binding motiiis in human 
interferon alpha 2: 

The sequence of INF2 was identified firom the GenBank database. The sequence with 
accession numbw P01563 was used throughout The protein sequences of the mature 

25 INF2 protein was identified and the sequoice exchiding the first 23 amino acid signal 
peptide analysed for the presmce of potential MHC class n binding motife (Figure 
11). Analysis was conducted by computer using MPT verl.O software (Biovation, 
Aberdeen, UK). This software package conducts "peptide threading" according to the 
methods disclosed in wO-A-y«59244. The software is able to provide an index of 

30 potential peptide binding to 1 8 different MHC class U DR alleles covering greater than 



wo 00/34317 



-30- 



PCT/GB99/04119 



96% of the HLA-DR allotypes extant in the human population. 

Results of the "peptide threading" process on the INF2, indicate the presence of a total 
of 18 individual potential epitopes. The epitopes map to five distinct clusters of 
5 overlapping epitopes encompassing residues 7-40 (cluster 1), residues 45-70 (cluster 
2), residues 79-103 (cluster 3), residues 108-132 (cluster 4) and residues 140-163 
(cluster 5). Each of the five clusters contain 5, 3, 4, 3 and 3 potential epitopes 
respectively. 

10 In order to prioritise epitopes for removal, the epitope clusters were then mapped to 
the known structure-function features on the molecule. For INF2 evidence fi-om 
homology modelUng (Murgolo N.J. et ai. Proteins: Structure, Function & Genetics 
17 62-74 (1993)), site directed mutagenesis (Tymms, M.J. et al. Antiviral Res. 12 37- 
48 (1989); Mclnnes B. et al, J. Interferon Res. 9 305-314 (1989)), cross-species 

15 chimaeric molecules (Ra, N.B.K. et al., J. Biol. Chem. 263 8943-8952 (1988); 
Shafferman, A. et al. J. Biol. Chem. 262 6227-6237 (1987)), deletion mutants (Wetzel, 
R. et al.. In: Interferons New York Academic Press, pp8 19-823 (1982)) and 
serological moping studies (Lydon N.B. et al.. Biochemistry 24 4131-4141 (1985); 
Trotta P.P. et al.,. In: The Interferon System Dianzani F & Rossi G.B (eds.) New York, 

20 Raven Press, 1985 pp231-235), suggest epitopes in clusters 1 and 2 are the highest 
priority targets for removal. With reference to the structural model of Murgolo et al 
(Murgolo N.J. et al (1993) ibid.), clustar 1 featuring 5 potential epitopes encompasses 
the fimctionally important helix A and the AB surface loop region of the INF2 
molecule. This region is important for the antiviral activity and is involved in binding 

25 to the human INF2 receptor structure. Most significantly, epitopes for neutralising 
antibodies have been m^ed to this region, specifically residues 10-1 1 of the helix A 
structural element (Lydon N.B. et al., (1985) ibid.). 

On tliis basis, cluslcr i epitopes at positions 7-19 and 13-25 were elimmated by 
30 substitution of threonine for leucine at position 15 (L15T using single letter codes). 



wo 00/34317 



-31- 



PCr/GB99/04119 



This procedure was conducted interactively in silico using the MPTverl package. 
Similarly, cluster 1 epitope at position 28-41 was eliminated by substitution F27Y. 
This latter substitution had the concomitant effect of reducing the number of i>otential 
binding alleles for the overlappmg epitope at position 22-34 from 13 to 11. 

5 

Cluster 2 epitopes extend from the AB loop through the helix B region and into the 
BC loop domain. Neutralising antibodies have been shown to bind in the BC loop 
(Lydon N.B. et ai, (1985) ibid.; Trotta P.P. et al.,. (1985) ibid. ) therefore cluster 2 
epitope at position 58-70 was also targeted for removal. Substitution NssL reduces 
10 potential immunogenicity in this region by eliminating 3 overlapping epitopes 
encompassing positions 61-77, which collectively are predicted to bind 5 different 
MHC DR alleles. Additionally, this substitution also reduces the number of different 
binding alleles at epitope 58-70 from 5 to 3. 

15 Otiier epitope cluster (e.g. 3-5) map to either buried regions of the molecule or 
surface regions not involved in receptor binding and hence provide antigenic sites to 
which antibody responses are largely non neutralising. 

20 Using the above method, an altered INF2 protem sequence was compiled containing 3 
substitutions from the starting sequence and is dq)icted in Figure 12. This sequence is 
predicted to be significantly less kmnunogaiic with respect to human MHC class n 
presentation. The areas of reduced immunogaiicity foctissed to regions of the 
molecule where antiTx>dy mediated neuti^sation of the protein has been shown to 

25 occur with the potential to limit the clinical efBcacy of the molecule as a therapeutic 
entity. 

Method for constmction of altered interferon alpha 2 molecule: 

A wilu-tyjfc IixA2 gcuc was syaiucsiseil under contract by Genosys Biotechnologies 
30 Ltd (Cambridge, UK). The gene was constructed by PCR using long (80-mer) 



wo 00/34317 



-32- 



PCT/GB99/04119 



overlapping synthetic DNA primers and the sequence given in GenBank accession 
number M29883. The synthetic gene was cloned as a 520bp £coRI-///>idni restriction 
fragment into bacterial expression vector pMEX8 (MoBiTec, Gottingen, Germany). 
The gene sequence was confirmed to be identical to the desired gene using 
5 commercially available reagent systems and instructions provided by the supplier 
(Amersham, Little Chalfont, UK). 

Altered (reduced immimogenicity) versions of the protein were constructed by site 
directed mutagenesis of the wild-type gene in pMEX8. Mutagenesis was conducted 
10 using short (18-mer) synthetic oligonucleotides obtained commercially (Genosys, 
Cambridge, UK) and the "quick-change" procedure and reagents from Stratagene 
(Cambridge, UK). Following site directed mutagenesis, DNA sequences of selected 
clones were confirmed as previously. 

15 For expression of recombinant wild-type and recombinant variant INA2 proteins, 
pMEX8-INA2 expression plasmids were transformed into E. coli strain JA221 and 
cells grown and harvested using standard procedures (Sambrook J., Fritisch E.F. & 
Maniatis T. (1989) ibid.). Recombinant INA2 was prepared essentially as described 
previously (Grosfeld, H. et al., in Advances in Biotechnological Processes 4, Mizrahi 

20 A. «& Van Wezel A. L. eds., pp59-78 Alan R. Liss Inc. New York (1985)) with minor 
modifications. Briefly, following high speed centrifiigation, the supernatant was 
concentrate by 60% saharation of ammonium sulphate and dioi chromatogr^hed on 
a 2.7 X 68cm Sephadex G-75 column equiKbrated with PBS plus 0.5M NaCl. 
Fractions (8ml) were collected at a Imin/ml flow rate. Pooled fractions were fiirther 

25 purified using an immunoaffinity column as described previously (Grosfeld, H. et al., 
(1985) ibid.). Purified proteins wctc assayed using SDS-PAGE analysis and 
fimctional activity (anti-viral activity) determined using biological assays as described 
previously (Shafferman, A. et al.,J. Biol. Chem. 262 6227-6237 (1987)). 



PCT/GB99/04119 



CLAIMS 



1.. AmeAodofi^dering aprotein,orpartofaprotem^non-inimunogenic, orless 
immunogenic, to a given species, the method comprising: 

5 

(a) determining at least part of the amino acid sequence of the protein; 

(b) identifying in the amino acid sequence one or more potential epitopes for T 
cells ("T cell epitopes") which are found in an endogenous protein of the given 

10 species; and 

(c) modifying the amino acid sequence to eliminate at least one of the T cell 
epitopes identified in step (b) thereby to reduce the immunogenicity of the protein or 
part thereof when exposed to the immune system of the given species. 

15 

2. A method as claimed in claim 1 wherein the given species is human. 

3. A method according to claim 1 or 2, wherein the amino acid sequence is 
modified by modification of a peptide which binds to MHC class 11 molecules. 

20 

4. A method according to claim 1 or 2, wherein the amiao acid sequence is 
modified by modification of a peptide which binds to MHC class I molecules. 

5. A method as claimed in any one of claims 1 to 4, wherein the T cell epitopes 
25 are identified by computation. 



6. A method as claimed in any one of claims 1 to 5, comprising additionally 
identifying and eliminating one or more potential T cell epitopes which are not foimd 
in an endogenous protein of the given species. 



wo 00/34317 



-34- 



PCT/GB99/04119 



7. A method as claimed in any one of claims 1 to 6, wherein an epitope 
considered as not generally available to the immune system to gain tolerance is 
identified as a potential epitope for elimination. 

5 

8. A method as claimed in claim 7, wherein an epitope of an intracellular protein 
is identified as a potential epitope for elimination. 

9. A method as claimed in any one of claims 1 to 8, wherein one or more B cell 
0 epitopes is also identified as a potential epitope for elimination. 

10. A method as claimed in any one of claims 1 to 9, wherein the protein is non- 
autologous. 

5 11. A method as claimed in any one of claims 1 to 9, wherein the protein is 
autologous. 



12. A method as claimed in any one of claims 1 to 11, wherein the protein is: a 
protein with an enzymatic activity which has a beneficial therapeutic effect; a protein 
which is used to convert inactive drugs to active drugs within a living organism; a 
protein which is used to vaccinate whereby one or more immunogatiic epitope is 
undesir^le; a protein wMch performs as a carrier of other molecules within the living 
organism; or a protein wiiich binds to other molecules within or introduced withm the 
living organism in order to altw the biodistribution of the other molecules. 

13. A method as claimed in any one of claims 1 to 12, wherein the protein is tested 
for desired activity. 



1 4. A method as claimed in claim 1 comprising: 

30 



wo 0004317 



-35- 



PCT/GB99/04119 



I. Detennining the amino acid sequence of the protein or a part thereof (if 
modification only of a part is required); 

n. Identifying potential T cell epitopes within th& amino acid sequence of the 
protein by any method including determination of the binding of peptides to 
MHC molecules, determination of the binding of peptide:MHC complexes to 
the T cell receptors fix»ra the species to receive the therapeutic protein, testing 
of the protein or peptide parts thereof using transgenic animals with the MHC 
molecules of the species to receive the therapeutic protein, or testing with 
transgenic animals reconstituted with inmiune system cells from the species to 
receive the therapeutic protein; 

III. By genetic engineering or other methods for producing modified proteins, 
altering the protein to remove one or more of the potential T cell epitopes and 
producing such an altered protein for testing; 

IV. (optionally) Within step m., altering the protein to remove one or more of the 
potential B cell epitopes; 

V. Testing altered proteins with one or more potential T cell epitopes (and 
optionally B cell epitopes) removed in order to identify a modified protein 
which has retained all or part of its desired activity but which has lost one or 
morp T cell q>itopes. 

15. A method as claimed in any one of claims 1 to 14, wherein a T cell epitope is 
eliminated by altoation of one or more amino acids within the q)itope itself. 

16. A method as claimed in claim IS, wherein the alteration is amino acid 
substitution. 

17. A method as claimed in claim 16, wherein the amino acid substitution is made 



wo 00/34317 



-36- 



PCT/GB99/04119 



with reference to a homologous protein primary structure. 

18. A method as claimed in claim 16, wherein the amino acid substitution is made 
on the basis of similar size and/or charge. 

5 

19. A method as claimed in claim 16, wherein the amino acid substitution is made 
with reference to in silico protein modelling techniques. 

20. A method as claimed in claim 13, wherein a high-throughput method is used 
10 for screening large numbers of modified proteins. 

21 . A molecule resulting from a method as claimed in any one of claims 1 to 20. 

22. A molecule as claimed in claim 21 for use in medicine or diagnosis. 

15 

23. The use of a molecule as claimed in claim 21 in the manufacture of an 
therapeutic or diagnostic agent 



wo (MW34317 



1/4 



PCT/GB99/04119 



A 

a-chain 

ALYTLITPAVLRTDTEEQILVEAHGDSTPKQLDIFVHDFPRKQKTLFQTRVDMNPAGGMLVTP 
TIEIPAKEVSTDSRQNQYVWQVTGPQVRLEKWLLSYQSSFLFIQTDKGIYTPGSPVLYRVF 
•SMDHNTSKMNlCmVEFQTPEGILVSSNSVDIOTFWPYNLPDLVSLGTWRIVAKYEHSPE^ 
AYFDWKYVLPSFEVRI^PSEKFFYIDGNENFHVSITARYLYGEEVEGVAFVLFGVKIDDAKK 
S I PDSLTRI PI IIXSIXJKATLKRDTFRSRFPNLNELVGHTLYASVTVMTESGSDMVVTBQSGIH 
IVASPYQIHFTKTPKYFKPGMPYELTVYVTNPDGSPAAHVPWSEAFHSMGTTLSDGTAKLIL 
NI PLNAQSLP I TVRTNHGDLPRERQATKSMTAIAYQTQGGSGNYLHVAITSTE I KPGDNLPVN 
FWnCGNANSLKQIKYFTYLIlJJKGKIFKVGRQPRRIXKJNLVTMNLHITPDLIPSFRFVAYYQV 
GmEIVADSVWVDVKDTCMGTLVVKGDNLIQMPGAAMKIKLEGDPGARVGLVAVDKAVYVLND 
KYKISQAKIWDTIEKSDFGCTAGSGQNNLGVFEDAGLALTTSTNLNTKQRSAAKCPQPAN 



B 

P-chain 

EIQMPTHKDLNLDITIELPDREVPIRYRINYENALLARTVETKLNQDITVTASGDGKATMTIL 
TFYNAQLQEKANVCNKFHLNVSVENIHLNAMGAKGALMLKI CTRYLGEVDSTMT I IDI SMLTG 
FLPDAEDLTRLSKGVDRYISRYEVDNNMAQKVAVIIYLNKVSHSEDECLHFKILKHFEVGFIQ 
PGSVKVYSYYNLDEKCTKFYHPDKGTGLLNKICIGNVCRCAGETCSSLNHQERIDVPLQIEKA 
CETNVDYVYKTKLLRIEEQIXShroiYVMDVLEVIKQGTDENPRAKTHQYISQRKCQEALNLKVN 
DDYLIWGSRSDLLPTKDKISYIITKNTWIERWPHEDECQEEEFQKLCDDFAQFSYTLTEFGCP 
T 



C 

y-chain 

DDNEDGFIADSDI ISRSDFPKSWLWLTKDLTEEPNSQGl SSKTMSFYLRDSITTWWLAVSFT 
PTKGICVAEPYEIRVMKVFFIDI<»iPYSVVKNEQVEIRAILHNYVNEDIYVRVELLYNPAFCS 
ASTKGQRYRQQFPIKALSSRAVPFVIVPLEQGLHDfVEIKASVQEALWSDGVRKKLKVVPEGVQ 
KSIVTIViCU>PRARGVGGTQLBVlKARKLDDRVPDTBIETKI I IQGDPVAQI lENSIDGSKIJI 
SIPD 



FIGURE 1 

Protein sequence of mature cobra venom factor 
A = alpha chain 
B = gamma chain 
C = beta chain 



wo 00/34317 



2/4 



PCr/GB99/04119 



ALYTLITPAVLRTDTEEQILVEAHGDSTPKQLDIFVHDFPRKQKTLFQTRVDMNPAGGMLVTP 

TIEIPAKEVSTDSRQNQYVWQVTGTQVRLEKVVLLSYQSSFLFIQTDKGIYTPGSPVLYRVF 

SMDHNTSKMNKTVIVEFQTPEGILVSSNSVDSLNFFWPYNLPDLVSLGTWRIVAKYEHSPENY 

TAYFDVRKYVLPSFEVIVEPTEKFYYIDGNENFHVSITARYLYGEEVEGVAFVLFGVKIDDAK 

ISLPBSLIGIIPIITODGKATLSRDTFRSRFPNLNELVGKSLYVSATVITESGSDMVVTEQSGI 

HIVASPY0lHH'iaPKYFKPGMeYELlW\rrNPIX3SPAAHVPVVVEAra 

LNIPLNAQSLPITVRTNHGDLPRERQATKSMTAIAYQTQGGSGNYLHVAITSTEIKPGDNLPV 

NFNVKGNANSLAQIKYFTYLILNKGKIFKVGRQPREPGQDLVVLNLHITPDLIPSFRLVAYYT 

LIGASGNNEIVADSVWVDVKDSCVGSIVVKGDNLIQMPGAAMKIKLEGDPGARVGI.VAVDKAV 

FVUTOKYKISQAKIWDTVVEKADIGCTAGSGQNNKJVFEDAGLALTTSTNLNTKQRSA 

PAN 

FIGURE 2 

Protein sequence of an altered cobra venom factor a-chain. 



DDNEDGFIADSDIISRSDFPKSWLWLTKDLTEEPNSQGISSKTMSFYLKDSITTVTEVLAVSFT 
PTKGICVADPFEVTVMKVFFIDLQMPYSVVKNEQVEIRAILHNYVNEDLYVRVELLYNPAFCS 
ASTTGQRYRQQFPIKALSSRAVPFVIVPLEQGLHDVEVKAAVYHHFISIXrVRKKLKVVPEGVQ 
KSTVTIVKLDPRAKGVGGTQLEVIKARKLDDRVPDTEIETKIIIQGDPVAQIIENSIDGSKLN 
SIPD 

FIGURE 3 

Protein sequence of an altered cobra venom factor y-chain. 



EIQMPTHKDI2nj)ITIELPDRBVPIRYRINYENASIJUlTVBTKIiNQDFTVTASGIX3I«VTOT 
TFYNAQICEKANVCNKFDLNVSVBNIHI^lAMGAKNTMILKICrRYLGEV^ IDISMI^TG 
FLPDABDLTRLSKG\7DmSRYBTONNMAQKmVIIYIJ)KVSHSEDDCTJIFO 
PGSVKVYSYYNLDESCTRFYHPDRCSTGLIjNKI CIGNLOICAGBTCSSLNHQERVDVPIjQIEKA 
CBTNVDYVYKTKLLRIBBQIXaroEYVMDVLEVIKQGTDBNORAKTHQyiSQ^ 
DHYLIWGSRSDIiLPTKDKI S YI IGKDTWVEHWPBEDECQEEE FQKLCDDFAQFSYTLTEFGCP 
T 



FIGURE 4 

Protein sequence of an altered cobra venom factor p-chain. 



wo 00/34317 



3/4 



PCT/GB99/04119 



lAGPEWLLDRPSVNNSQLWSyAGTVEGTNQDI SLKFFE IDLTSRPAHGGKTEQGLSPKS 
KPFATDSGAMPHKLEKADLLKAIQEQLIANVHSNDDYFEVIDFASDATITDRNGKVYFAD 
KDGSVTLPTQPVQEFLLSGHVRVRPYKEKPIQNQAKSVDVEYTVQFTPLNPDDDFRPGLK 
DTKLLKTLAIGDTITSQBLIAQAQSILNKTHPGYTIYBRDSSIVTHDNDIFRTILPMDQE 
FTYHVraniEQAYEINKKSGLNEEINNTDLISEKrYVLKKGEKPYDPFDRSHLKLFTIKY^ 
■ DVNTNELLKSEQLLTASERNLDFRDLYDPRDKRKLLYNNIjDAFGIMDYTLTGK\^ 
TNRI I TVYMGKRPEGENASYHLAYDKDRYTEEEREVySYLRYTGTPI PDNPNDK 

FIGURE 5 

Protein sequence of streptokinase from Streptococcus equisimilis 



IAGPEWLLDRPSVNNSQLWSVAGTVEGTNQDISLKFFEIDLTSRPAHGGKTEQGLSPKS 
KPFATDSGAMPHKLEKADLLKAKQEQLIANVHSNDDYFEVIDFASDATITDRNGKVYFAD 
KDGSVTLPTQPVQEFLLSGHVRVRPYKEKPIQNQAKSVDVEYTVQFTPIjNPDDDFRPGLK 
OTKLLKTIAIGDTITSQELLAQAQSirjJKTHPGYTIYBRDSSIVTHDNDIFRTILPMDQE 
FTYHVKNREQAYEINKKSGLNEEINNTDLISEKYYVLKKGEKPYDPFDRSHLKLFTIKFV 
DVNTNELLKSEQLLTASERNLDFRDLYDPRDKAKLLYNNLDAFGIMDYTLTGKVEDNHDD 
TNRIITVYMGKRPEGENASYHLAYDKDRYTEEERBVYSYLRYTGTPIPDNPNDK 

FIGURE 6 

Protein sequence of an altered streptokinase molecule. 



SSSFDKGKYKKGDDASYFEPTGPYLMVNVTGVDSKGNELLSPHAVEFPIKPGTTLTKEKIEYY 
VEWALDATAYKEFRVVEUDPSAKIEVTYYDKNKKKEErKSFPITEKGFWPDLSEHIKNPGFN 
LITKWIEKK 

FIGURE 7 

Protein sequoice of mature staphylokinase. 



SSSFDKGKYKEGDDASQFBPTGPYUWNVTGVDSAGNALLSPHYVEFPIKPTTLTBERIKYY^ 
EWALDATAYAAFAVVELDPSARVEVTYYDIOJKKKEBTKSFPlTEKGFVVPDTSEHIKNPGFNIi 
FTKWTEKK 



FIGURE 8 

Protein sequence of an altered st^hylokinase molecule. 



wo 00/34317 



4/4 



PCT/GB99/04119 



DVS FRLSGATTTS YGVF I KNLREALP YERKVYNI PLLRS S I SGSGRYTLLHLTNYADET I SVA 

VDVTNVYIhKSYIAGDVSYFFNEASATEAAKFVFKDAKKKVTLPYSGNYERIKJTAAGKIRE^ 

LGLPALDSAITTLYYYTASSAASALLVLIQSTAESARYKFIEQQIGKRVDKTFLPSLATISLE 

NNWSALSKQIQIASTNNGQFESPWLIDGNNQRVSITNASARWTSNIALLLNRNNIAAIGED 

ISMTLIGFEHGLYGI 

FIGURE 9 

Protein sequence of wild-type bryodin 1 . 



DVSFSMSGATTTSYGVFVKNLREALPFERKVYNI PLLRSS I SGSGRYTLLHLTNYADETI SVA 
VDVTNVYIMGYLAGDVSYFFNEASATEAAKFVFKDAKKKVTLPYSGNHERLQTAAGKIRENIP 
LGLPALDSAITTLYYYTASSAASALLVLIQSTAESARFKFIEQQIGKRVDKTFLPSLATISLE 
NNWSALSKQIQIASTNNGQFESPWLVDGNNQSVSITNASARWTSNVALLLNRNNIAAVGED 
ISMTLIGFEHGLYGI 

FIGURE 10 

Protein sequence of an altered bryodin 1 molecule. 



CTLPQTHSLGSRRTIMIJAQMRKISLFSCLKDRHDFGFPQEEF^QFQKAETIPVLHEMIQQI 
FNLFSTKDSSAAWDBTLLDKFYTELYQQLNDLKACVIQGVG\nOTPLMKEDSILAVRKYFQRI 
TLYLKEKKYSPCAWBWRAEIMRSFSLSTNLQESLRSKE 

FIGURE 11 

Protein sequence of mature human interferon alpha 2 



a)LPQTHSLGSRRTTMLIAa^RiaSLFYCLKDRHDFGFPQEEFGNQFQKAETIP\nJI^ 

FLLPSTIOJSSAAWDETLIJJKFYTELYQQIJroLEAC^QGVGVTETPLMKEDSIIAVRK^ 

TLYLKEKKYSPCAWEWRAEIMRSFSIiSTNLOESLRSKB 



FIGURE 12 

Protein sequence of an altered human interferon alpha 2 molecule 



